Survey on Outlier Detection in Data Stream

نویسندگان

  • Pooja Thakkar
  • Jay Vala
  • Vishal Prajapati
  • Jiawei Han
  • Micheline Kamber
  • Jian Pei
  • Charu C. Aggarwal
  • QIANG YANG
  • Karanjit Singh
  • Shuchita Upadhyaya
  • Manish Gupta
  • Jing Gao
  • Neeraj Chugh
  • Mitali Chugh
  • Alok Agarwal
  • Fabrizio Angiulli
  • Fabio Fassetti
  • Md. Shiblee Sadik
  • Le Gruenwald
  • Dragoljub Pokrajac
  • Aleksandar Lazarevic
  • Longin Jan Latecki
  • Seyed Hesamodin Karimian
  • Manouchehr Kelarestaghi
  • Sattar Hashemi
چکیده

Data mining provides a way for finding hidden and useful knowledge from the large amount of data .usually we find any information by finding normal trends or distribution of data .But sometimes rare event or data object may provide information which is very interesting to us .Outlier detection is one of the task of data mining .It finds abnormal data point or sequence hidden in the dataset .Data stream is unbounded sequence of data with explicit or implicit temporal context .Data stream is uncertain and dynamic in nature. Traditional outlier detection techniques for static data which require whole dataset for modelling is not suitable for data stream because whole data stream cannot be stored. Network intrusion detection ,web click stream analysis ,fraud detection ,fault detection in machines ,sensor data analysis are some of the applications of data stream outlier detection .In this paper, we have described several issues in data stream outlier detection and usual approaches or techniques for finding outlier in data stream .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Survey on Outlier Detection Techniques in Dynamic Data Stream

Outlier detection has significant importance in the data mining domain. Applications which contain streaming data flow may have many abnormal or outlier data and these applications require efficient outlier detection techniques to detect and analyze these abnormal patterns. Outlier detection is the process of detecting patterns in the data which do not adhere to the normal behavior or data. The...

متن کامل

A Study of Clustering Based Algorithm for Outlier Detection in Data streams

Recently many researchers have focused on mining data streams and they proposed many techniquesand algorithms for data streams. It refers to the process of extracting knowledge from nonstop fast growing data records. They are data stream classification, data stream clustering, and data stream frequentpattern items and so on. Data stream clustering techniques are highly helpful to cluster the si...

متن کامل

Comparative Study of Incremental Learning Algorithms in Multidimensional Outlier Detection on Data Stream

Multi-dimensional outlier detection (MOD) over data streams is one of the most significant data stream mining techniques. When multivariate data are streaming in high speed, outliers are to be detected efficiently and accurately. Conventional outlier detection method is based on observing the full dataset and its statistical distribution. The data is assumed stationary. However, this convention...

متن کامل

Continuous Adaptive Outlier Detection on Distributed Data Streams

In many applications, stream data are too voluminous to be collected in a central fashion and often transmitted on a distributed network. In this paper, we focus on the outlier detection over distributed data streams in real time, firstly, we formalize the problem of outlier detection using the kernel density estimation technique. Then, we adopt the fading strategy to keep pace with the transie...

متن کامل

Anomaly Detection over Concept Drifting Data Streams

Outlier detection over data streams has attracted attention for many emerging applications, such as network intrusion detection, web click stream and aircraft health anomaly detection. Since the data stream is likely to change over time, it is important to be able to modify the outlier detection model appropriately with the evolution of the stream. Most existing approaches were using incrementa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016